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ABSTRACT 



Various methods of caching web resources include caching 
in accordance with a number of times accessed, a frequency 
of access, or a duration of access. One method of caching 
web resources includes the step of accessing a first web 
resource. The first web resource is cached, if no other web 
resource is accessed after a pre -determined period of time. 
Another method of caching web resources includes the step 
of accessing a first web resource. The first web resource is 
cached, if the first web resource is subsequently accessed 
more than a pre-determined number of times. Another 
method of caching web resources includes the step of 
accessing a plurality of web resources. The accessed web 
resources are cached as cached web resources in accordance 
with at least one of a number of times accessed, a frequency 
of access, or a duration of access. An apparatus comprises 
storage media containing caching logic for caching web 
resources. The caching logic includes instructions to cache 
selected web resources as cached web resources in accor- 
dance with at least one of a number of times accessed, a 
frequency of access, or a duration of access. The selected 
web resources correspond to a subset of a plurality of 
accessed web resources. 

3 Claims, 8 Drawing Sheets 
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METHOD OF CACHING WEB RESOURCES 

This application is a continuation of application Ser. No. 
08/918,726 that was filed on Aug. 21, 1997, now abandoned. 

FIELD OF THE INVENTION 

This invention relates to the field of computers. In 
particular, this invention is drawn to caching web resources 
obtained from the Internet. 

BACKGROUND OF THE INVENTION 

The Internet is a worldwide network of computers and 
computer resources sharing a common communication pro- 
tocol to facilitate the communication of information between 
computers that may have different hardware and software 
architectures including different operating systems and file 
systems. 

The Internet uses a client/server architecture for exchang- 
ing information. A client computer requests resources from 
one or more host computers somewhere on the Internet. 
These resources may include files or services such as 
information retrieval services, search services, communica- 
tion services, and recreational services. 

A subset of Internet host computers provide multimedia 
information services. This subset of host computers support 
a protocol which permits sharing hypermedia documents 
between computers having different architectures, operating 
systems, and file systems. These hypermedia documents can 
be viewed or accessed using a "browser" application pro- 
gram on the client computers. 

The World Wide Web is a wide-area hypermedia infor- 
mation retrieval initiative aiming to give universal access to 
a large universe of information stored in computers that use 
different hardware and software architectures. Hypermedia 
is similar to multimedia except that hypermedia contains 
"hyperlinks" or simply "links" to other information includ- 
ing text, sounds, images, movies, etc. The information may 
be embodied in the form of a document. The "web" is 
virtually formed by these hyperlinks. Thus the "web" refers 
to a body of information or abstract space of knowledge. 
Physical access to this body of information is frequently 
accomplished using the Internet. A hypermedia document 
viewed using a browser is often referred to as a "web page." 

Web pages often contain links to other web resources. For 
example, a web page may contain a link to another position 
within the same web page, another web page located some- 
where else on the Internet, or links to initiate other services 
such as searching or file retrieval. These other web 
resources, for example, can be files designed to produce 
sounds or movies. The link destination is identified by a 
Uniform Resource Locator (URL). Every WWW resource 
located anywhere on the Internet can be identified by a URL. 

Using these links within documents, a user may navigate 
to various web pages located throughout the Internet by 
following a chain of links. After following a chain of links, 
a user may need to return to a common page to select another 
chain of links to follow. This may be the case when 
exploring a web site or when searching the World Wide Web 
for sites related to a given topic. 

For example, when a client computer issues a search 
request to a host computer, the host may respond by creating 
a web page containing links to sites containing information 
relevant to the user's query. This web page is referred to as 
the common page. The user may choose to follow a given 
link from the common page. After reviewing the web page 
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associated with the selected link, the user can typically 
return to the common page to pursue another link. For 
example, the user may return to the common page in order 
to pursue links to other web pages that were indicated as 

5 being relevant to a search request. 

One disadvantage of current browsers is the lack of 
sophisticated heuristics for caching web pages and other 
resources. In one caching scheme the browser caches web 
pages on a first-in-first out basis. The number of web pages 

10 cached is dependent upon the resources (i.e., storage 
requirements) of each web page as well as the amount of 
storage set aside for caching web pages. If the common page 
is not cached, the web browser must access the host's search 
engine again in order to regenerate the common page. 

15 This caching technique has a number of disadvantages. 
One disadvantage of this technique is the inefficient use of 
the host machine resources. In particular, the client computer 
wastes host computer time by requesting the host computer 
to regenerate the common web page each time it is accessed. 

20 Another disadvantage of regenerating the common page is 
the time wasted by the client waiting for regeneration and 
retrieval of the common web page. Yet another disadvantage 
is the increased Internet traffic associated with reissuing the 
saved URL and retrieving the generated common page from 

25 the host machine's search engine. 

SUMMARY OF THE INVENTION 

A method of caching web resources includes the step of 
30 accessing a first web resource. The first web resource is 
cached, if no other web resource is accessed after a pre- 
determined period of time. 

Another method of caching web resources includes the 
step of accessing a first web resource. The first web resource 
35 is cached, if the first web resource is subsequently accessed 
more than a pre -determined number of times. 

Another method of caching web resources includes the 
step of accessing a plurality of web resources. The accessed 
web resources are cached as cached web resources in 
40 accordance with at least one of a number of times accessed, 
a frequency of access, or a duration of access. 

An apparatus includes storage media containing caching 
logic for caching web resources. The caching logic includes 
instructions to cache selected web resources as cached web 
45 resources in accordance with at least one of a number of 
times accessed, a frequency of access, or a duration of 
access. The selected web resources correspond to a subset of 
a plurality of accessed web resources. 

Other features and advantages of the present invention 
50 will be apparent from the accompanying drawings and from 
the detailed description that follows below. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention is illustrated by way of example 
55 and not limitation in the figures of the accompanying 
drawings, in which like references indicate similar elements 
and in which: 

FIG. 1 illustrates a client-server architecture networked 
6q computer environment. 

FIG. 2 illustrates one embodiment of a client computer. 
FIG. 3 illustrates a web page and hyperlinks to other web 
resources. 

FIG. 4 illustrates navigation within a web page having 
65 hyperlinks to itself. 

FIG. 5 illustrates one embodiment of a method of caching 
web resources such as a web page. 
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FIG. 6 illustrates an embodiment of another method of A subset of Internet host computers provide multimedia 

caching web resources such as a web page. information services. This subset of host computers permit 

FIG. 7 illustrates a method of caching a first web resource physical access to the abstract body of information referred 

such as a web page after a request to access a second web to as the World Wide Web (WWW) and are referred to as 

resource is received. 5 WWW hosts or WWW servers. 

FIG. 8 illustrates retrieval of cached web resources such World Wide Web host computers support a protocol 

as web pages. which permits users having computers with different 

DETAILED DESCRIPTION architectures, operating systems, and application programs 

to share multimedia enhanced documents. In one 

FIG. 1 illustrates a network environment 100 including a 10 embodiment, this protocol is the Hypertext Transport Pro- 
communication network commonly referred to as the Inter- tocol ("HTTP"). The multimedia enhanced documents are 
net 150. The Internet includes a combination of routers, 0 ft en referred to as "web pages." The application specific to 
repeaters, gateways, bridges, and communications links with a g i ve n hardware platform that permits viewing the web 
computers spread throughout the world. The Internet facili- pages ^ 0 ft e n referred to as a browser, 
tales conQmunication between computers or other devices is Hypermedia enhanced documents typically provide links 
connected to the Internet. ^ hyperlinks ) t0 other res0 urces in response to selection 

Some of the computers are referred to as host computers of m anchor nc anchor fe typicaU y a word> group 0 f words, 

because they provide services upon request. The computers or demarcated area of the document. For 

issuing the requests are referred to as client computers. graphical-based browsers, the anchor is generally activated 

Network environment 100 includes multiple (N) client 20 by pomt j n g t0 me anchor and clicking a selection button of 

computers (110 thru 120) and multiple (M) host computers a pointing device such as a mouse. For text-based browsers, 

(130 thru 140). t he an chor is generally activated by using cursor-control 

In one embodiment, computers may use an Internet Ser- keys to select the anchor and pressing another key (e.g., 

vice Provider (ISP) to gain access to the Internet. For <ENTER>). These anchors and links are defined using a 

example, clients 110 and 112 use ISP(l) 160 to access the 25 language such as hypertext markup language ("HTML"). 

Internet. Client 120 uses ISP(X) 170 to access the Internet. The browser interprets an HTML file to display the web 

Host computers such as hosts 132 and 140, for example, page. The HTML file also determines what resource is 

can be directly connected to the Internet. Alternatively, a accessed in response to activation of a given anchor, 

host computer (e.g., 130) may use an ISP (e.g., 162) to gain 3o Uniform Resource Locators (URLs) provide a standard 

access to the Internet. wav Q f referencing Internet resources including web 

The host computers (e.g., 130) and client computers (e.g., resources regardless of whether the resource is a document, 

110) can be entirely different architectures, however, they an image, a movie, a sound, or another web page. A browser, 

can communicate by using a common communication pro- for example, can access a host machine identified by the 

tocol. In one embodiment, this protocol is the Transmission 35 URL and then retrieve the resource specified by the URL. A 

Control Protocol/Internet Protocol (TCP/IP). uniform resource locator (URL) is a description of an item 

FIG, 2 illustrates one embodiment of a client computer including the location of the item that is to be retrieved. For 

architecture. Client computer 200 includes processor 210. example, the location might be the local disk drive or 

Input devices such as mouse 220 and keyboard 230 permit another file at another Internet site. The URL is not limited 

the user to input data to client computer 200. Information 4Q to other World Wide Web sites and may in fact refer to other 

generated by the processor is provided to an output device Internet protocols and services such as Gopher, WAIS, 

such as display 240, Client computer 200 typically includes UseNet news, Telnet, or anonymous FTP (file transfer 

random access memory (RAM) 260 which is used by the protocol). 

processor for executing application programs. a URL identifies the protocol as well as the location of the 

In one embodiment, client computer 200 includes non- 45 item to be retrieved. For example, consider the following 

volatile memory 270. Nonvolatile memory 270 is used to URL: 
retain programs and data even when the client computer is 
powered down. In one embodiment nonvolatile memory 270 

is an electromechanical hard drive. In other embodiments, http://www.intcl.com/intei/produci/bdex.htmi 

nonvolatile memory 270 might be a solid state disk drive or 50 -j^ tjr L identifies the protocol as "http" ("Hypertext 

a flash electrically erasable programmable read only Transport Protocol"). Other protocols include "gopher" (to 

memory (flash EEPROM). a Gopher session), "ftp" (to initiate a file transfer), 

Mouse 220, keyboard 230, RAM 260, and nonvolatile «fii e » ( t0 retrieve a local file), "wais" (for accessing a WAIS 

memory 270 are typically communicatively coupled to server), "news" (for reading UseNet newsgroups), and "tel- 

processor 210 through one or more address and data buses 55 ne t" for initiating a Telnet session. 

such as bus 250. The portion "www.intel.com" is an Internet host address 

The client computer can request services from a host or symbolic representation of an Internet host address. Thus 

computer. Hosts typically provide services such as file "www.intel.com" identifies a specific host. The portion of 

retrieval services, search services, communication services, the URL identifying the specific host is often referred to as 

and recreational services. 60 a web site. 

For example, Gopher provides information retrieval ser- The remainder is a UNIX-style path for the resource that 

vices. Popular search services include Archie. Veronica is is being accessed. Thus "intel/product/index.html" instructs 

the name of another search service typically used in con- a browser to retrieve the file index.html from the directory 

junction with Wide Area Information Servers (WAIS). Com- path "intel/product" on the Internet host, 

munication services include electronic mail, UseNet, Telnet, 65 Often a user may simply provide the protocol and Internet 

and Internet Relay Chat (IRC). An example of a recreational host address such as "http://www.intel.com" without explic- 

service is a Multi-User Dungeon (MUD). itly identifying a specific resource. The server software 



03/16/2004, EAST Version: 1.4.1 



us 6,5: 

5 

executing on the host machine, however, provides a default 
resource in response to such a request. In one embodiment, 
this might be a file named "index.html" or "index.htm" for 
machines having file systems that do not support longer 
filename extensions. Thus the use of the web site alone 
implicitly identifies a default resource. For web pages, this 
default resource is often referred to as the "home page." 
Alternatively, the home page may be explicitly defined. 
Thus the URL either explicitly or implicitly identifies a 
specific resource, protocol, and host machine on the Internet. 

A given web page may be formed from a number of 
resources identified within the web page's html file. For 
example, "index. html" contains instructions for generating a 
hypermedia document for the client. The client's browser 
interprets the html file in order to produce the corresponding 
web page. The html file may instruct the client's browser to 
access any number of other web resources located on the 
host or elsewhere including bitmap files, files for producing 
animated video or sound, etc. Each of these resources is 
identified either explicitly or implicitly by a URL. 

FIG. 3 illustrates one example of using URLs to access 
other Internet resources. Web page 300 includes anchors 302 
and 304 for accessing other web resources. Anchors are 
often identified by highlighted text, an icon, or an image on 
the web page. In one embodiment, the web page is defined 
using hypertext markup language ("HTML") to describe the 
text, icons, images, etc. of the web page. 

The highlighted text, icon, or image is associated with a 
URL. This association is the hyperlink. Often the terms 
"hyperlink" or "link" are used synonymously with the term 
"anchor." Thus "anchor 302" is equivalent to "link 302," or 
"hyperlink 302." 

Selecting an anchor effectively selects and follows the 
hyperlink to the resource identified by the associated URL. 
Selecting anchors 302 or 304, for example, will access other 
Internet resources that may be physically located in various 
host computers throughout the Internet. The selection is 
typically accomplished with a pointing device such as a 
mouse. 

For some applications (i.e., searching) users often re-visit 
web pages previously visited. For example, the user may 
have used an Internet search engine to generate a web page 
containing the results of a search. This resulting web page 
may contain a number of links to other web pages. Users 
typically select links one at a time from the resulting web 
page. After following a given link, the user returns to the 
resulting web page in order to follow another link suggested 
by the search engine. Thus the web page resulting from the 
search may be visited a number of times. 

Referring to FIG. 3, web page 300 provides hyperlinks 
generated as a result of a search. Web page 310 is accessed 
by selecting hyperlink 302. From web page 310, the user can 
access web page 320 by selecting anchors 314 or 312. 
Anchor 312 is an example of using an image or an icon as 
an anchor. From web page 320, the user can select anchor 
322 to return to web page 310. Thus the user can follow or 
navigate a chain of links by selecting the anchors. 

After perusing web pages 310 and 320, the user can return 
to web page 300 in order to follow another link identified by 
the search results. For example, the user can return to the 
search results web page 300 in order to follow the chain of 
links starting with hyperlink 304. Selection of anchor 304 
accesses web page 330. From web page 330, the user can 
access web page 340 by selecting anchor 336. The user can 
return to web page 330 by selecting anchor 342 on web page 
340. Web page 330 also contains hyperlinks to web page 
310, Thus the user can access web page 310 by selecting 
anchor 332. 



16,479 B2 

6 

Web page 300 may be referred to as the "common page." 
Web page 300 is the common link to all of the other links 
contained within web page 300. For example, web page 300 
provides access to the chain of links beginning with link 302 

5 and the chain of links beginning with 304 regardless of 
whether or how those chains might subsequently intersect. 

In order to facilitate following links, browsers typically 
save the URL of the most recently accessed web pages. The 
user can navigate "forward" and "backward" among their 

10 most recently selected web pages even if the web pages do 
not contain specific links to each other. Thus for example, 
web page 330 provides a link to web page 310, but the 
converse is not true. The browser provides means to navi- 
gate "backward" and "forward" among the most recently 

15 selected web pages. Thus, for example, if web page 310 was 
accessed from web page 330, the user can return to web page 
330 by navigating backward using features of the browser. 
This type of approach may be necessary in order to return to 
search page 300 after following a chain of links. 

20 FIG. 4 illustrates an example of following hyperlinks to 
another location within the same web page. Web page 400 
can only be viewed a portion at a time due to the actual size 
of the display as indicated by outline 420. In addition to text, 
web page 400 includes anchors 440 and 450 to other 

25 resources. 

In the illustrated embodiment, anchor 440 is an icon that 
indicates that selection of anchor 440 will access a movie. In 
one embodiment, the selection of anchor 440 accesses a file 
identified by an associated URL. The browser recognizes the 
30 file as containing data for generating a movie. The browser 
either plays the movie in accordance with the data or 
provides the file to another application in order to play the 
movie. 

Anchor 450 is an icon that indicates that selection of 
35 anchor 450 will access a sound. In one embodiment, the 
selection of anchor 450 accesses a file identified by an 
associated URL. The browser recognizes the file as contain- 
ing data for generating sound. The browser either interprets 
the data to generate the sound or provides the file to another 
40 application in order to generate the sound. 

Web page 400 also provides an index to other locations 
within the same web page. Selecting anchor 410, for 
example, repositions the web page 400 within the viewing 
area of the display so that the user might see view 460. Due 
45 to the physical limitations of the displayable area, the 
browser typically provides a method of scrolling (e.g., scroll 
bars) so that the user may navigate within the same web page 
by scrolling. 

The process of accessing a host machine and retrieving 
50 the specified resource in accordance with the URL may take 
a considerable amount of time. In an effort to decrease the 
time spent waiting to download web pages, browsers typi- 
cally have a cache feature for caching web pages. When a 
URL specifies a web page that has previously been cached, 
55 the browser may retrieve the cached version of the web page 
instead of accessing the host machine again when re-visiting 
the web page. 

One caching mechanism attempts to cache every web 
page accessed. Web pages are cached on a first-in-first -out 

60 basis so that the most recently accessed non-cached web 
page can be cached. 

One disadvantage of this method is inefficient usage of the 
cache may result in excessive storage set aside for the cache. 
For example, web pages that are not likely to be re-visited 

65 are cached. Due to size constraints of the cache, frequently 
re-visited web pages or web pages with a high likelihood of 
being re -visited, such as common pages, might be flushed 
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from the cache in order to permit caching the most recently In an alternative embodiment, the table and cache are 

accessed web page. In order to provide a greater likelihood erased and re-initialized when requested by the user. Thus 

that frequently visited web pages remain within the cache, web resources may be cached based upon the total number 

the cache size may be increased. Increasing the cache size, 0 f times accessed independent of the number of browsing 

however, consumes computer resources that could otherwise 5 sessions. 

be allocated for other purposes. In yet another embodiment, the table provides for tracking 

FIG. 5 illustrates a method of cach.ng web resources the re , ative of ^ ajdM web , f ^ ^ 

based on the length of tune spent browsing the web resource. , _ . , , . *u t. t. ^ j • j j 

i *u ~~u~a~Z * n * * a ' ot/-" c *L u * has remained m the cache beyond a pre -determined period 

In the embodiment illustrated in FIG. 5, the web resource is - 4 . iL 4 . , c K , 

a web page 0 ' * e entry * or tnat resource mav be 

The method of caching web resources begins with step 10 "validated and the resource may be flushed from the cache. 

510. A first web page is accessed in step 520. If no other web relative a 8 e ma y be tracked m the table b y providing an 

page is selected after a pre-determined period of time as additional field and storing the date and time that the 

determined in step 530, then the first web page is cached. corresponding resource was cached. Alternatively, the field 

Otherwise the first web page is not cached and processing is stores a value that permits determining in what order the 

completed in step 590. 15 resources were cached. 

In one embodiment, the first web page is automatically Some browsers "cache" resources as they are being 

cached after the pre-determined time period has elapsed, if downloaded from a host. The methods described above may 

no other web page is selected. be applied to determine when and if resources should be 

In an alternative embodiment, the first web page is cached flushed from the cache as opposed to whether they are 

only when a second web page has been selected after a 20 cached. In other words, the methods described above can be 

pre-determined time period. use( j to determine the persistence of resources stored in the 

FIG. 6 illustrates another method for caching web cache, 

resources based on the number of visits to a given web For example, consider a browser that caches resources as 

resource. In this embodiment the web resource is a web they m downloaded. In one embodiment, the methods 

P a S e * 25 described above are applied to determine which resources 

The method of caching web pages begins with step 610. should remain m the cache wheD a triggermg event initiates 

A first web page is accessed in step 620. If the same web flushing of at least a portion of the cache, 

page was previously 'accessed ^a pre-determined number of , n 0Qe embod iment, a triggering event is when the 

times as determined by step 630, then the web page * cached brQWSer ^ Iq anQther embodiment) exiti the browser 

m step 640 and processing * completed in step 690. Thus 30 a licalion is a trig g e ring event. In another embodiment, 

after the initial access, the first web page is cached if it is flushin of at ^ a ioQ of ^ cacfae fa iniliated wheQ 

subsequently accessed more than a pre-determmed number lhe cache feaches a pre . de termined size. In yet another 

o times. embodiment, flushing of at least a portion of the cache is 

Various techniques may be used to track the number of initiated whenever the cache has less thaD a ^-determined 

times a particular web resource is accessed. For example, a 35 nt of ava ii ab i e spa ce left, 
table may be used. The table entries are indexed by a unique 

, u , t 1* ** ttt>t c u i» Thus caching: in the context of cache persistence means 

key such as the complete, explicit URL for each web „ . , & . A , , \ .. A . - lL 

J , r r • , lU , , , ■ allowing the resource to stay in the cache. Application of the 

resource. When a web resource is accessed, the table is 1 . * 

scanned to determine if that web resource already has a methods described above to cache persistence permits 

corresponding entry in the table by searching for that web 40 "H*™* the mles descnbed above across multl P le browser 

resource's URL in the table. If the web resource has not sessions. 

previously been accessed, then a new entry in the table is For example, a resource that is accessed infrequently 

created using that web resource's URL. If the web resource durm 6 a 6 iven br owser session may persist in the cache 

has previously been accessed (as determined by scanning the under lhe total number of accesses rule, if a sufficient 

table) then a count field associated with that web resource's 45 number of accesses occur before a triggering event flushes 

URL in the table is incremented to reflect another access has the resource, even if the accesses are accumulated over more 

been made. tnan one Drow ser session. In one embodiment, the count 

In one embodiment, a resource such as an html file might field associated with the URL of a given resource is incre, 

instruct the browser to generate a web page from a number rnented each time the given resource is accessed. If the value 

of other resources. In such a case, an entry is created in the 50 m the count field fe less than a pre-determined value when 

table for each web resource accessed. the triggering event occurs, then the resource is flushed from 

In one embodiment, the count field is incremented only if me cacne - 

the subsequent accesses are interleaved with accesses to Thus one method of persistence caching includes the step 

other web resources. Thus if the web resource being tracked of caching at least one web resource of a plurality of web 

is a web page, consecutive accesses that merely reposition 55 resources accessed during a plurality of browser sessions. A 

the web page within the display do not increase the count. count of the number of accesses to each cached web resource 

Visiting another web page and subsequently returning to the is computed. Selected cached web resources are flushed in 

first web page will, however, increase the count. response to a triggering event, if the corresponding count is 

In one embodiment, the table and cache are erased and less than a pre-determined value, 

re-initialized upon starting the browser application. Thus 60 Similarly, the table may include a "session" field to aid in 

web resources are cached based upon the number of times tracking an average frequency of access across multiple 

accessed within a given browsing session. If this is the case, browser sessions. Thus each time the browser is started up, 

then the table is effectively a frequency table. In other words, the session field is incremented for all URL entries. The 

the web resource is cached if the same resource is accessed count field is incremented for a given URL whenever access 

more than a predetermined number of times within a given 65 is made to the resource identified by the given URL. When 

browsing session. Thus the web resource is cached based a triggering event occurs, the average frequency of access 

upon frequency of access. can be calculated 
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residing on local area network 140 may cache the web 

count x resources on local area network server 144. 

sessions/ FIGS. 5 and 6 effectively indicate conditions that deter- 
mine if an accessed web resource should be cached, but not 

for each URL to determine if the resource identified by the 5 when the accessed web resource should be flushed, 

given URL should remain in the cache or be flushed. If the Accessed web resources can be flushed as the need arises, if 

average frequency of access for a given URL is less than a tnev are not cached or have not been selected for caching, 

pre -determined value at the time a triggering event occurs, Eventually, even cached web resources may need to be 

then the identified resource is flushed. flushed in order to permit caching additional web resources. 

Thus another method of persistence caching includes the io In one embodiment, cached web resources are flushed on a 

step of caching at least one web resource of a plurality of first-in, first-out basis. In this embodiment, the use of a 

web resources accessed during a plurality of browser ses- relative age field, as described above* permits tracking the 

sions. An access count of the number of accesses to each order in which web resources were cached for purposes of 

cached web resource is computed. A session count of the determining the order in which they should be flushed, 

number of sessions a selected web resource is accessed is 15 Thus methods of caching web resources based on duration 

computed. Selected cached web resource are flushed in 0 f access, number of times accessed, and frequency of 

response to a triggering event, if the corresponding average access have been described. Various embodiments may 

frequency of access determined by dividing the correspond- combine these metnods of caching . In p art icular, the browser 

ing access count by the corresponding session count is less may ^ configured to apply caching ^ io accordance with 

T^e d"h^ may also be used in the cache 20 lhe type ° f reSOUrCC - For CXam P le ' ^ files ^ be 

peSe^ V^*^.™ ^ext files might be asso- 

"duration" field. When accessing a given resource, the ™ ted ™* a dlffer ^ nt c u ach u m S mle * In addition once the 

length of time spent without accessing any other resource is determination is made whether a resource should be cached, 

measured. This length of time is then added to the duration the ^source may be cached at that time or in response to a 

field. The duration field thus indicates the accumulated time 25 rec l uesl to access another resource. 

spent on the given resource. When a triggering event occurs, FIG - 7 » for example, combines the methods presented 

the given resource is flushed unless the accumulated time in above for determining if a web resource is selected for 

the duration field exceeds a pre-determined value. caching. The web resource is not cached, however, until a 

Thus a duration of access for persistence caching includes request to access another web resource is received. The 

the step of caching at least one web resource of a plurality 30 method begins with step 710 using web pages as examples 

of web resources accessed during a plurality of browser of web resources. 

sessions. A length of time spent without accessing any other In step 720, a first web page is accessed. In the embodi- 

resource is accumulated for each cached web resource. ment illustrated in FIG. 7, this first web resource or web 

Selected cached web resource are flushed in response to a page is not presently being cached, 

triggering event, if the corresponding accumulated length of 35 In step 730, a request to access a second web page is 

time is less than a pre-determined value. received. In this context, the second web page is a web 

In one embodiment, regardless of the caching rule resource other than the first web page. Thus selecting an 

applied, the table entry identifying a given resource is anchor on the first web page in order to reposition the first 

deleted or marked invalid once the given resource is flushed. web within the display is not considered to be receiving a 

Referring to the client computer architecture illustrated in 40 request to access another web resource or a "second" web 

FIG, 2, various storage media may be used for caching the page. 

web resources. For example, in one embodiment, the web After a request to access a second web page is received, 

resource is cached in RAM 260. the method makes the determination of whether the first web 

In an alternative embodiment, the web resource is cached resource should be cached in steps 740 and 750. In 

using nonvolatile storage such as nonvolatile memory 270. 45 particular, if the first web page was previously accessed a 

In one embodiment, nonvolatile memory 270 is a solid state pre-determined number of times, as determined in step 740, 

disk. In another embodiment, nonvolatile memory 270 is then the first web page is cached in step 760. Alternatively, 

flash electrically erasable programmable read only memory. if a pre-determined time period has elapsed since accessing 

When a web resource is cached it must be mapped to its the first web page as determined in step 750, then the first 

corresponding full URL. Otherwise a "name collision" may 50 web page is cached in step 760. 

occur, for example, if resources with the same name (e.g., The requested resource, (i.e., the second web page) is not 

" index. html") are downloaded from different sites. accessed until step 770. Thus the second web page is not 

This can be accomplished by caching the web resource accessed until either 1) a decision is made not to cache the 

and identifying it directly by its full URL. In one first web page, or 2) a determination is made to cache the 

embodiment, the full URL is expressed as a property of the 55 first web page and the first web page is cached, 

file. Thus the file can be renamed to avoid name collisions, FIG. 7 thus combines the methods of caching based on a 

but identified for subsequent retrieval by examining the file duration of access and a number of times the resource is 

properties of the cached files, requested. The resource is not cached until a request to 

In another embodiment, each entry of the caching table access another resource is received, 

includes a field to identify the name of the resource as 60 FIG. 8 illustrates one embodiment of a method of retriev- 

cached. Thus if the resource must be renamed due to a name ing a cached web resource. In step 810 a request is made to 

collision, the name as cached is readily available from the access a web resource, in particular a web page. Step 820 

same table that indicates whether the resource was cached. determines if the requested web page is in the cache. If the 

In another embodiment, a separate mapping table can be web page is not in the cache, then the web resource is 

used to map files as cached to their corresponding full URLs. 65 retrieved from the Internet in step 840. If; however, the web 

The web resource need not be cached locally on the client page is cached, then the browser retrieves the cached version 

computer. Referring to FIG. 2, for example, client 136 of the web page instead of the actual web resource in step 
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830, Various embodiments may permit the user to "refresh" 
the web page by retrieving it again from the host computer 
instead of retrieving the cached version of the resource. 

In one embodiment, the methods described above are 
implemented as a series of instructions within a program. 
The program may be implemented in software, firmware, or 
hardware. 

The firmware or hardware implementations may be par- 
ticularly suited for systems designed specifically to support 
Internet tools and applications, and communication proto- 
cols. These machines are referred to as NC ("network 
computer") machines. 

One example of such a machine is the NC manufactured 
by Oracle Corporation, Redwood Shores, Calif. The NC 
machines achieve hardware costs lower than that of a typical 
personal computer by offering few capabilities other than the 
ability to "browse" the Internet. As a result, the machines 
typically do not have standard peripherals such as floppy 
disk drives, hard disk drives, or optical disk drives, etc. 
Other resources such as memory may be somewhat less than 
that found on a more versatile typical personal computer. 

The methods described above may be particularly suited 
for machines with such limited resources. The instructions 
for implementing the methods described above may be 
stored within ROM, flash memory, or some other nonvola- 
tile memory within the NC hardware. 

The software implementation may be incorporated within 
another application program. In one embodiment, for 
example, the instructions may be included as part of the 
browser. In another embodiment, the instructions may be 
part of the operating system. The software itself may be 
stored or distributed on any variety of storage media includ- 
ing optical disks, magnetic disks, tapes, etc. 

The methods described above are also applicable to 
"intranets." An intranet is typically considered to be a 
network used within a business entity such as a corporation. 
Although the intranet is usually intended for internal use 
only, the business organization may permit the outside 
Internet community to access all or a part of the intranet. 

An intranet uses the same communication protocols and 
tools as the Internet. Thus in one embodiment an intranet 
supports TCP/IP and Internet tools on an organization's 
local area network (LAN) or wide-area network (WAN). 
With reference to FIG. 1, LAN 180 is an intranet in one 
embodiment. The intranet offers the ability to leverage the 
availability all of the tools and applications available for the 
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Internet for the business organization's exclusive use with 
its own network system. 

Thus, for example, different divisions within a corporate 
entity can create their own web pages to announce employ- 

5 ment opportunities, new product offerings, etc. The intranet 
web page might provide a front-end interface to other 
information systems within the organization. This could 
permit, for example, a finance or accounting division to 
retrieve accounting information stored in a mainframe by 

10 using the web page as a front-end for accessing a mainframe 
database. The use of the caching methods described above 
may tend to improve response time at each client computer. 
In addition, caching at each client computer tends to reduce 
overall intranet traffic, thus leaving even more resources 

1 5 available for handling additional clients or further improving 
the responsiveness to intranet client requests. 

In the preceding detailed description, the invention is 
described with reference to specific exemplary embodiments 
thereof. Various modifications and changes may be made 

20 thereto without departing from the broader spirit and scope 
of the invention as set forth in the claims. The specification 
and drawings are, accordingly, to be regarded in an illus- 
trative rather than a restrictive sense. 
What is claimed is: 

25 1. An apparatus comprising: 

storage media for containing caching logic for caching 
web resources, wherein the caching logic includes 
instructions to cache selected web resources in accor- 

30 dance with a length of time spent on a selected web 
resource wherein the selected web resource corre- 
sponds to a subset of a plurality of accessed resources 
and wherein an accessed web resource is selected for 
caching if no other web resource is accessed after a 

35 predetermined period of time. 

2. The apparatus of claim 1 further comprising: 

a cache, wherein the caching logic includes instructions to 
retrieve a cached web resource from the cache in 
response to an access request for a previously accessed 
40 web resource, if the cached web resource corresponds 
to the previously accessed web resource. 

3. The apparatus of claim 1 wherein the instructions form 
a portion of at least one of an operating system and a 
browser, 

45 

* * * * * 
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